Overview
Brought to you by YData
Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 506 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 55.5 KiB |
| Average record size in memory | 112.3 B |
Variable types
| Numeric | 13 |
|---|---|
| Categorical | 1 |
AGE is highly overall correlated with CRIM and 7 other fields | High correlation |
CRIM is highly overall correlated with AGE and 8 other fields | High correlation |
DIS is highly overall correlated with AGE and 6 other fields | High correlation |
INDUS is highly overall correlated with AGE and 7 other fields | High correlation |
LSTAT is highly overall correlated with AGE and 7 other fields | High correlation |
MEDV is highly overall correlated with AGE and 7 other fields | High correlation |
NOX is highly overall correlated with AGE and 8 other fields | High correlation |
PTRATIO is highly overall correlated with MEDV | High correlation |
RAD is highly overall correlated with CRIM and 2 other fields | High correlation |
RM is highly overall correlated with LSTAT and 1 other fields | High correlation |
TAX is highly overall correlated with AGE and 7 other fields | High correlation |
ZN is highly overall correlated with AGE and 4 other fields | High correlation |
CHAS is highly imbalanced (62.7%) | Imbalance |
ZN has 360 (71.1%) zeros | Zeros |
Reproduction
| Analysis started | 2025-01-26 19:58:27.448103 |
|---|---|
| Analysis finished | 2025-01-26 19:58:47.609393 |
| Duration | 20.16 seconds |
| Software version | ydata-profiling vv4.12.2 |
| Download configuration | config.json |
Variables
CRIM
Real number (ℝ)
High correlation 
| Distinct | 406 |
|---|---|
| Distinct (%) | 80.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.9463849 |
| Minimum | 0.00632 |
|---|---|
| Maximum | 6.9184395 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0.00632 |
|---|---|
| 5-th percentile | 0.02791 |
| Q1 | 0.083235 |
| median | 0.29025 |
| Q3 | 3.611874 |
| 95-th percentile | 6.9184395 |
| Maximum | 6.9184395 |
| Range | 6.9121195 |
| Interquartile range (IQR) | 3.528639 |
Descriptive statistics
| Standard deviation | 2.6332452 |
|---|---|
| Coefficient of variation (CV) | 1.3528903 |
| Kurtosis | -0.5606899 |
| Mean | 1.9463849 |
| Median Absolute Deviation (MAD) | 0.255695 |
| Skewness | 1.0694454 |
| Sum | 984.87075 |
| Variance | 6.9339804 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.9184395 | 81 | 16.0% |
| 3.611873971 | 20 | 4.0% |
| 0.01501 | 2 | 0.4% |
| 0.08829 | 1 | 0.2% |
| 0.04741 | 1 | 0.2% |
| 0.00632 | 1 | 0.2% |
| 0.02731 | 1 | 0.2% |
| 4.34879 | 1 | 0.2% |
| 4.03841 | 1 | 0.2% |
| 3.56868 | 1 | 0.2% |
| Other values (396) | 396 |
| Value | Count | Frequency (%) |
| 0.00632 | 1 | |
| 0.00906 | 1 | |
| 0.01096 | 1 | |
| 0.01301 | 1 | |
| 0.01311 | 1 | |
| 0.0136 | 1 | |
| 0.01381 | 1 | |
| 0.01432 | 1 | |
| 0.01439 | 1 | |
| 0.01501 | 2 |
| Value | Count | Frequency (%) |
| 6.9184395 | 81 | |
| 6.80117 | 1 | 0.2% |
| 6.71772 | 1 | 0.2% |
| 6.65492 | 1 | 0.2% |
| 6.53876 | 1 | 0.2% |
| 6.44405 | 1 | 0.2% |
| 6.39312 | 1 | 0.2% |
| 6.28807 | 1 | 0.2% |
| 5.87205 | 1 | 0.2% |
| 5.82401 | 1 | 0.2% |
ZN
Real number (ℝ)
High correlation  Zeros 
| Distinct | 11 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.5232559 |
| Minimum | 0 |
|---|---|
| Maximum | 28.029835 |
| Zeros | 360 |
| Zeros (%) | 71.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 11.211934 |
| 95-th percentile | 28.029835 |
| Maximum | 28.029835 |
| Range | 28.029835 |
| Interquartile range (IQR) | 11.211934 |
Descriptive statistics
| Standard deviation | 10.810961 |
|---|---|
| Coefficient of variation (CV) | 1.6572952 |
| Kurtosis | -0.34015755 |
| Mean | 6.5232559 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.2029836 |
| Sum | 3300.7675 |
| Variance | 116.87687 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 360 | |
| 28.029835 | 68 | 13.4% |
| 11.21193416 | 20 | 4.0% |
| 20 | 20 | 4.0% |
| 12.5 | 10 | 2.0% |
| 22 | 10 | 2.0% |
| 25 | 10 | 2.0% |
| 21 | 4 | 0.8% |
| 28 | 2 | 0.4% |
| 18 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 360 | |
| 11.21193416 | 20 | 4.0% |
| 12.5 | 10 | 2.0% |
| 17.5 | 1 | 0.2% |
| 18 | 1 | 0.2% |
| 20 | 20 | 4.0% |
| 21 | 4 | 0.8% |
| 22 | 10 | 2.0% |
| 25 | 10 | 2.0% |
| 28 | 2 | 0.4% |
| Value | Count | Frequency (%) |
| 28.029835 | 68 | |
| 28 | 2 | 0.4% |
| 25 | 10 | 2.0% |
| 22 | 10 | 2.0% |
| 21 | 4 | 0.8% |
| 20 | 20 | 4.0% |
| 18 | 1 | 0.2% |
| 17.5 | 1 | 0.2% |
| 12.5 | 10 | 2.0% |
| 11.21193416 | 20 | 4.0% |
INDUS
Real number (ℝ)
High correlation 
| Distinct | 77 |
|---|---|
| Distinct (%) | 15.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.083992 |
| Minimum | 0.46 |
|---|---|
| Maximum | 27.74 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0.46 |
|---|---|
| 5-th percentile | 2.18 |
| Q1 | 5.19 |
| median | 9.9 |
| Q3 | 18.1 |
| 95-th percentile | 19.58 |
| Maximum | 27.74 |
| Range | 27.28 |
| Interquartile range (IQR) | 12.91 |
Descriptive statistics
| Standard deviation | 6.6991648 |
|---|---|
| Coefficient of variation (CV) | 0.60440001 |
| Kurtosis | -1.1439136 |
| Mean | 11.083992 |
| Median Absolute Deviation (MAD) | 6.46 |
| Skewness | 0.30987063 |
| Sum | 5608.4998 |
| Variance | 44.878808 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18.1 | 127 | |
| 19.58 | 28 | 5.5% |
| 8.14 | 22 | 4.3% |
| 11.08399177 | 20 | 4.0% |
| 6.2 | 18 | 3.6% |
| 21.89 | 14 | 2.8% |
| 3.97 | 12 | 2.4% |
| 9.9 | 12 | 2.4% |
| 10.59 | 11 | 2.2% |
| 8.56 | 11 | 2.2% |
| Other values (67) | 231 |
| Value | Count | Frequency (%) |
| 0.46 | 1 | 0.2% |
| 0.74 | 1 | 0.2% |
| 1.21 | 1 | 0.2% |
| 1.22 | 1 | 0.2% |
| 1.25 | 2 | |
| 1.32 | 1 | 0.2% |
| 1.38 | 1 | 0.2% |
| 1.47 | 2 | |
| 1.52 | 4 | |
| 1.69 | 2 |
| Value | Count | Frequency (%) |
| 27.74 | 5 | 1.0% |
| 25.65 | 6 | 1.2% |
| 21.89 | 14 | 2.8% |
| 19.58 | 28 | 5.5% |
| 18.1 | 127 | |
| 15.04 | 3 | 0.6% |
| 13.92 | 4 | 0.8% |
| 13.89 | 3 | 0.6% |
| 12.83 | 6 | 1.2% |
| 11.93 | 5 | 1.0% |
CHAS
Categorical
Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 KiB |
| 0.0 | |
|---|---|
| 1.0 | 34 |
| 0.06995884773662552 | 20 |
Length
| Max length | 19 |
|---|---|
| Median length | 3 |
| Mean length | 3.6324111 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 452 | |
| 1.0 | 34 | 6.7% |
| 0.06995884773662552 | 20 | 4.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 452 | |
| 1.0 | 34 | 6.7% |
| 0.06995884773662552 | 20 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 978 | |
| . | 506 | |
| 6 | 60 | 3.3% |
| 5 | 60 | 3.3% |
| 7 | 40 | 2.2% |
| 9 | 40 | 2.2% |
| 8 | 40 | 2.2% |
| 2 | 40 | 2.2% |
| 1 | 34 | 1.8% |
| 4 | 20 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1838 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 978 | |
| . | 506 | |
| 6 | 60 | 3.3% |
| 5 | 60 | 3.3% |
| 7 | 40 | 2.2% |
| 9 | 40 | 2.2% |
| 8 | 40 | 2.2% |
| 2 | 40 | 2.2% |
| 1 | 34 | 1.8% |
| 4 | 20 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1838 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 978 | |
| . | 506 | |
| 6 | 60 | 3.3% |
| 5 | 60 | 3.3% |
| 7 | 40 | 2.2% |
| 9 | 40 | 2.2% |
| 8 | 40 | 2.2% |
| 2 | 40 | 2.2% |
| 1 | 34 | 1.8% |
| 4 | 20 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1838 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 978 | |
| . | 506 | |
| 6 | 60 | 3.3% |
| 5 | 60 | 3.3% |
| 7 | 40 | 2.2% |
| 9 | 40 | 2.2% |
| 8 | 40 | 2.2% |
| 2 | 40 | 2.2% |
| 1 | 34 | 1.8% |
| 4 | 20 | 1.1% |
NOX
Real number (ℝ)
High correlation 
| Distinct | 81 |
|---|---|
| Distinct (%) | 16.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.55469506 |
| Minimum | 0.385 |
|---|---|
| Maximum | 0.871 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0.385 |
|---|---|
| 5-th percentile | 0.40925 |
| Q1 | 0.449 |
| median | 0.538 |
| Q3 | 0.624 |
| 95-th percentile | 0.74 |
| Maximum | 0.871 |
| Range | 0.486 |
| Interquartile range (IQR) | 0.175 |
Descriptive statistics
| Standard deviation | 0.11587768 |
|---|---|
| Coefficient of variation (CV) | 0.20890339 |
| Kurtosis | -0.064667133 |
| Mean | 0.55469506 |
| Median Absolute Deviation (MAD) | 0.0875 |
| Skewness | 0.72930792 |
| Sum | 280.6757 |
| Variance | 0.013427636 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.538 | 23 | 4.5% |
| 0.713 | 18 | 3.6% |
| 0.437 | 17 | 3.4% |
| 0.871 | 16 | 3.2% |
| 0.624 | 15 | 3.0% |
| 0.489 | 15 | 3.0% |
| 0.693 | 14 | 2.8% |
| 0.605 | 14 | 2.8% |
| 0.74 | 13 | 2.6% |
| 0.544 | 12 | 2.4% |
| Other values (71) | 349 |
| Value | Count | Frequency (%) |
| 0.385 | 1 | 0.2% |
| 0.389 | 1 | 0.2% |
| 0.392 | 2 | |
| 0.394 | 1 | 0.2% |
| 0.398 | 2 | |
| 0.4 | 4 | |
| 0.401 | 3 | |
| 0.403 | 3 | |
| 0.404 | 3 | |
| 0.405 | 3 |
| Value | Count | Frequency (%) |
| 0.871 | 16 | |
| 0.77 | 8 | |
| 0.74 | 13 | |
| 0.718 | 6 | 1.2% |
| 0.713 | 18 | |
| 0.7 | 11 | |
| 0.693 | 14 | |
| 0.679 | 8 | |
| 0.671 | 7 | 1.4% |
| 0.668 | 3 | 0.6% |
RM
Real number (ℝ)
High correlation 
| Distinct | 446 |
|---|---|
| Distinct (%) | 88.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.2846344 |
| Minimum | 3.561 |
|---|---|
| Maximum | 8.78 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 3.561 |
|---|---|
| 5-th percentile | 5.314 |
| Q1 | 5.8855 |
| median | 6.2085 |
| Q3 | 6.6235 |
| 95-th percentile | 7.5875 |
| Maximum | 8.78 |
| Range | 5.219 |
| Interquartile range (IQR) | 0.738 |
Descriptive statistics
| Standard deviation | 0.70261714 |
|---|---|
| Coefficient of variation (CV) | 0.11179921 |
| Kurtosis | 1.8915004 |
| Mean | 6.2846344 |
| Median Absolute Deviation (MAD) | 0.3455 |
| Skewness | 0.40361213 |
| Sum | 3180.025 |
| Variance | 0.49367085 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.229 | 3 | 0.6% |
| 6.127 | 3 | 0.6% |
| 5.713 | 3 | 0.6% |
| 6.417 | 3 | 0.6% |
| 6.405 | 3 | 0.6% |
| 6.167 | 3 | 0.6% |
| 5.304 | 2 | 0.4% |
| 5.39 | 2 | 0.4% |
| 6.193 | 2 | 0.4% |
| 4.138 | 2 | 0.4% |
| Other values (436) | 480 |
| Value | Count | Frequency (%) |
| 3.561 | 1 | |
| 3.863 | 1 | |
| 4.138 | 2 | |
| 4.368 | 1 | |
| 4.519 | 1 | |
| 4.628 | 1 | |
| 4.652 | 1 | |
| 4.88 | 1 | |
| 4.903 | 1 | |
| 4.906 | 1 |
| Value | Count | Frequency (%) |
| 8.78 | 1 | |
| 8.725 | 1 | |
| 8.704 | 1 | |
| 8.398 | 1 | |
| 8.375 | 1 | |
| 8.337 | 1 | |
| 8.297 | 1 | |
| 8.266 | 1 | |
| 8.259 | 1 | |
| 8.247 | 1 |
AGE
Real number (ℝ)
High correlation 
| Distinct | 349 |
|---|---|
| Distinct (%) | 69.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68.518519 |
| Minimum | 2.9 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 2.9 |
|---|---|
| 5-th percentile | 18.4 |
| Q1 | 45.925 |
| median | 74.45 |
| Q3 | 93.575 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 97.1 |
| Interquartile range (IQR) | 47.65 |
Descriptive statistics
| Standard deviation | 27.439466 |
|---|---|
| Coefficient of variation (CV) | 0.40046788 |
| Kurtosis | -0.89845748 |
| Mean | 68.518519 |
| Median Absolute Deviation (MAD) | 20.9 |
| Skewness | -0.59426137 |
| Sum | 34670.37 |
| Variance | 752.9243 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 42 | 8.3% |
| 68.51851852 | 20 | 4.0% |
| 97.9 | 4 | 0.8% |
| 96 | 4 | 0.8% |
| 87.9 | 4 | 0.8% |
| 98.8 | 4 | 0.8% |
| 95.4 | 4 | 0.8% |
| 76.5 | 3 | 0.6% |
| 32.2 | 3 | 0.6% |
| 36.6 | 3 | 0.6% |
| Other values (339) | 415 |
| Value | Count | Frequency (%) |
| 2.9 | 1 | |
| 6.2 | 1 | |
| 6.5 | 1 | |
| 6.6 | 2 | |
| 6.8 | 1 | |
| 7.8 | 2 | |
| 8.4 | 1 | |
| 8.9 | 1 | |
| 9.8 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 100 | 42 | |
| 99.3 | 1 | 0.2% |
| 99.1 | 1 | 0.2% |
| 98.9 | 3 | 0.6% |
| 98.8 | 4 | 0.8% |
| 98.7 | 1 | 0.2% |
| 98.5 | 1 | 0.2% |
| 98.4 | 2 | 0.4% |
| 98.3 | 2 | 0.4% |
| 98.2 | 2 | 0.4% |
DIS
Real number (ℝ)
High correlation 
| Distinct | 412 |
|---|---|
| Distinct (%) | 81.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.7950427 |
| Minimum | 1.1296 |
|---|---|
| Maximum | 12.1265 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 1.1296 |
|---|---|
| 5-th percentile | 1.461975 |
| Q1 | 2.100175 |
| median | 3.20745 |
| Q3 | 5.188425 |
| 95-th percentile | 7.8278 |
| Maximum | 12.1265 |
| Range | 10.9969 |
| Interquartile range (IQR) | 3.08825 |
Descriptive statistics
| Standard deviation | 2.1057101 |
|---|---|
| Coefficient of variation (CV) | 0.55485809 |
| Kurtosis | 0.48794112 |
| Mean | 3.7950427 |
| Median Absolute Deviation (MAD) | 1.29115 |
| Skewness | 1.0117806 |
| Sum | 1920.2916 |
| Variance | 4.4340151 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.4952 | 5 | 1.0% |
| 5.4007 | 4 | 0.8% |
| 5.2873 | 4 | 0.8% |
| 5.7209 | 4 | 0.8% |
| 6.8147 | 4 | 0.8% |
| 6.0622 | 3 | 0.6% |
| 6.4798 | 3 | 0.6% |
| 3.6519 | 3 | 0.6% |
| 7.309 | 3 | 0.6% |
| 6.498 | 3 | 0.6% |
| Other values (402) | 470 |
| Value | Count | Frequency (%) |
| 1.1296 | 1 | |
| 1.137 | 1 | |
| 1.1691 | 1 | |
| 1.1742 | 1 | |
| 1.1781 | 1 | |
| 1.2024 | 1 | |
| 1.2852 | 1 | |
| 1.3163 | 1 | |
| 1.3216 | 1 | |
| 1.3325 | 1 |
| Value | Count | Frequency (%) |
| 12.1265 | 1 | |
| 10.7103 | 2 | |
| 10.5857 | 2 | |
| 9.2229 | 1 | |
| 9.2203 | 2 | |
| 9.1876 | 1 | |
| 9.0892 | 1 | |
| 8.9067 | 2 | |
| 8.7921 | 2 | |
| 8.6966 | 1 |
RAD
Real number (ℝ)
High correlation 
| Distinct | 9 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.5494071 |
| Minimum | 1 |
|---|---|
| Maximum | 24 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 5 |
| Q3 | 24 |
| 95-th percentile | 24 |
| Maximum | 24 |
| Range | 23 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 8.7072594 |
|---|---|
| Coefficient of variation (CV) | 0.91181152 |
| Kurtosis | -0.86723199 |
| Mean | 9.5494071 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.0048146 |
| Sum | 4832 |
| Variance | 75.816366 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 132 | |
| 5 | 115 | |
| 4 | 110 | |
| 3 | 38 | 7.5% |
| 6 | 26 | 5.1% |
| 8 | 24 | 4.7% |
| 2 | 24 | 4.7% |
| 1 | 20 | 4.0% |
| 7 | 17 | 3.4% |
| Value | Count | Frequency (%) |
| 1 | 20 | 4.0% |
| 2 | 24 | 4.7% |
| 3 | 38 | 7.5% |
| 4 | 110 | |
| 5 | 115 | |
| 6 | 26 | 5.1% |
| 7 | 17 | 3.4% |
| 8 | 24 | 4.7% |
| 24 | 132 |
| Value | Count | Frequency (%) |
| 24 | 132 | |
| 8 | 24 | 4.7% |
| 7 | 17 | 3.4% |
| 6 | 26 | 5.1% |
| 5 | 115 | |
| 4 | 110 | |
| 3 | 38 | 7.5% |
| 2 | 24 | 4.7% |
| 1 | 20 | 4.0% |
TAX
Real number (ℝ)
High correlation 
| Distinct | 66 |
|---|---|
| Distinct (%) | 13.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 408.23715 |
| Minimum | 187 |
|---|---|
| Maximum | 711 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 187 |
|---|---|
| 5-th percentile | 222 |
| Q1 | 279 |
| median | 330 |
| Q3 | 666 |
| 95-th percentile | 666 |
| Maximum | 711 |
| Range | 524 |
| Interquartile range (IQR) | 387 |
Descriptive statistics
| Standard deviation | 168.53712 |
|---|---|
| Coefficient of variation (CV) | 0.4128412 |
| Kurtosis | -1.142408 |
| Mean | 408.23715 |
| Median Absolute Deviation (MAD) | 73 |
| Skewness | 0.66995594 |
| Sum | 206568 |
| Variance | 28404.759 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 666 | 132 | |
| 307 | 40 | 7.9% |
| 403 | 30 | 5.9% |
| 437 | 15 | 3.0% |
| 304 | 14 | 2.8% |
| 264 | 12 | 2.4% |
| 398 | 12 | 2.4% |
| 384 | 11 | 2.2% |
| 277 | 11 | 2.2% |
| 330 | 10 | 2.0% |
| Other values (56) | 219 |
| Value | Count | Frequency (%) |
| 187 | 1 | 0.2% |
| 188 | 7 | |
| 193 | 8 | |
| 198 | 1 | 0.2% |
| 216 | 5 | |
| 222 | 7 | |
| 223 | 5 | |
| 224 | 10 | |
| 226 | 1 | 0.2% |
| 233 | 9 |
| Value | Count | Frequency (%) |
| 711 | 5 | 1.0% |
| 666 | 132 | |
| 469 | 1 | 0.2% |
| 437 | 15 | 3.0% |
| 432 | 9 | 1.8% |
| 430 | 3 | 0.6% |
| 422 | 1 | 0.2% |
| 411 | 2 | 0.4% |
| 403 | 30 | 5.9% |
| 402 | 2 | 0.4% |
PTRATIO
Real number (ℝ)
High correlation 
| Distinct | 46 |
|---|---|
| Distinct (%) | 9.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.455534 |
| Minimum | 12.6 |
|---|---|
| Maximum | 22 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 12.6 |
|---|---|
| 5-th percentile | 14.7 |
| Q1 | 17.4 |
| median | 19.05 |
| Q3 | 20.2 |
| 95-th percentile | 21 |
| Maximum | 22 |
| Range | 9.4 |
| Interquartile range (IQR) | 2.8 |
Descriptive statistics
| Standard deviation | 2.1649455 |
|---|---|
| Coefficient of variation (CV) | 0.11730604 |
| Kurtosis | -0.28509138 |
| Mean | 18.455534 |
| Median Absolute Deviation (MAD) | 1.15 |
| Skewness | -0.80232493 |
| Sum | 9338.5 |
| Variance | 4.6869891 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20.2 | 140 | |
| 14.7 | 34 | 6.7% |
| 21 | 27 | 5.3% |
| 17.8 | 23 | 4.5% |
| 19.2 | 19 | 3.8% |
| 17.4 | 18 | 3.6% |
| 19.1 | 17 | 3.4% |
| 18.6 | 17 | 3.4% |
| 18.4 | 16 | 3.2% |
| 16.6 | 16 | 3.2% |
| Other values (36) | 179 |
| Value | Count | Frequency (%) |
| 12.6 | 3 | 0.6% |
| 13 | 12 | 2.4% |
| 13.6 | 1 | 0.2% |
| 14.4 | 1 | 0.2% |
| 14.7 | 34 | |
| 14.8 | 3 | 0.6% |
| 14.9 | 4 | 0.8% |
| 15.1 | 1 | 0.2% |
| 15.2 | 13 | 2.6% |
| 15.3 | 3 | 0.6% |
| Value | Count | Frequency (%) |
| 22 | 2 | 0.4% |
| 21.2 | 15 | 3.0% |
| 21.1 | 1 | 0.2% |
| 21 | 27 | 5.3% |
| 20.9 | 11 | 2.2% |
| 20.2 | 140 | |
| 20.1 | 5 | 1.0% |
| 19.7 | 8 | 1.6% |
| 19.6 | 8 | 1.6% |
| 19.2 | 19 | 3.8% |
B
Real number (ℝ)
| Distinct | 282 |
|---|---|
| Distinct (%) | 55.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 381.91884 |
| Minimum | 344.10625 |
|---|---|
| Maximum | 396.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 344.10625 |
|---|---|
| 5-th percentile | 344.10625 |
| Q1 | 375.3775 |
| median | 391.44 |
| Q3 | 396.225 |
| 95-th percentile | 396.9 |
| Maximum | 396.9 |
| Range | 52.79375 |
| Interquartile range (IQR) | 20.8475 |
Descriptive statistics
| Standard deviation | 19.054913 |
|---|---|
| Coefficient of variation (CV) | 0.049892571 |
| Kurtosis | -0.23058954 |
| Mean | 381.91884 |
| Median Absolute Deviation (MAD) | 5.46 |
| Skewness | -1.1642076 |
| Sum | 193250.93 |
| Variance | 363.0897 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 396.9 | 121 | |
| 344.10625 | 77 | 15.2% |
| 395.24 | 3 | 0.6% |
| 393.74 | 3 | 0.6% |
| 389.71 | 2 | 0.4% |
| 394.12 | 2 | 0.4% |
| 395.6 | 2 | 0.4% |
| 388.45 | 2 | 0.4% |
| 392.78 | 2 | 0.4% |
| 395.11 | 2 | 0.4% |
| Other values (272) | 290 |
| Value | Count | Frequency (%) |
| 344.10625 | 77 | |
| 344.91 | 1 | 0.2% |
| 347.88 | 1 | 0.2% |
| 348.13 | 1 | 0.2% |
| 348.93 | 1 | 0.2% |
| 349.48 | 1 | 0.2% |
| 350.45 | 1 | 0.2% |
| 350.65 | 1 | 0.2% |
| 351.85 | 1 | 0.2% |
| 352.58 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 396.9 | 121 | |
| 396.42 | 1 | 0.2% |
| 396.33 | 1 | 0.2% |
| 396.3 | 1 | 0.2% |
| 396.28 | 1 | 0.2% |
| 396.24 | 1 | 0.2% |
| 396.23 | 1 | 0.2% |
| 396.21 | 2 | 0.4% |
| 396.14 | 1 | 0.2% |
| 396.06 | 2 | 0.4% |
LSTAT
Real number (ℝ)
High correlation 
| Distinct | 439 |
|---|---|
| Distinct (%) | 86.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.715432 |
| Minimum | 1.73 |
|---|---|
| Maximum | 37.97 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 1.73 |
|---|---|
| 5-th percentile | 3.7375 |
| Q1 | 7.23 |
| median | 11.995 |
| Q3 | 16.57 |
| 95-th percentile | 26.8075 |
| Maximum | 37.97 |
| Range | 36.24 |
| Interquartile range (IQR) | 9.34 |
Descriptive statistics
| Standard deviation | 7.0127389 |
|---|---|
| Coefficient of variation (CV) | 0.55151401 |
| Kurtosis | 0.6634895 |
| Mean | 12.715432 |
| Median Absolute Deviation (MAD) | 4.695 |
| Skewness | 0.92729111 |
| Sum | 6434.0086 |
| Variance | 49.178507 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12.7154321 | 20 | 4.0% |
| 8.05 | 3 | 0.6% |
| 14.1 | 3 | 0.6% |
| 6.36 | 3 | 0.6% |
| 7.79 | 3 | 0.6% |
| 18.13 | 3 | 0.6% |
| 3.11 | 2 | 0.4% |
| 4.56 | 2 | 0.4% |
| 6.72 | 2 | 0.4% |
| 7.6 | 2 | 0.4% |
| Other values (429) | 463 |
| Value | Count | Frequency (%) |
| 1.73 | 1 | |
| 1.92 | 1 | |
| 1.98 | 1 | |
| 2.47 | 1 | |
| 2.87 | 1 | |
| 2.88 | 1 | |
| 2.94 | 1 | |
| 2.96 | 1 | |
| 2.97 | 1 | |
| 2.98 | 1 |
| Value | Count | Frequency (%) |
| 37.97 | 1 | |
| 36.98 | 1 | |
| 34.77 | 1 | |
| 34.41 | 1 | |
| 34.37 | 1 | |
| 34.02 | 1 | |
| 31.99 | 1 | |
| 30.81 | 2 | |
| 30.63 | 1 | |
| 30.62 | 1 |
MEDV
Real number (ℝ)
High correlation 
| Distinct | 229 |
|---|---|
| Distinct (%) | 45.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.532806 |
| Minimum | 5 |
|---|---|
| Maximum | 50 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 10.2 |
| Q1 | 17.025 |
| median | 21.2 |
| Q3 | 25 |
| 95-th percentile | 43.4 |
| Maximum | 50 |
| Range | 45 |
| Interquartile range (IQR) | 7.975 |
Descriptive statistics
| Standard deviation | 9.1971041 |
|---|---|
| Coefficient of variation (CV) | 0.40816505 |
| Kurtosis | 1.4951969 |
| Mean | 22.532806 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.1080984 |
| Sum | 11401.6 |
| Variance | 84.586724 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 16 | 3.2% |
| 25 | 8 | 1.6% |
| 23.1 | 7 | 1.4% |
| 22 | 7 | 1.4% |
| 21.7 | 7 | 1.4% |
| 20.6 | 6 | 1.2% |
| 19.4 | 6 | 1.2% |
| 22.6 | 5 | 1.0% |
| 21.4 | 5 | 1.0% |
| 21.2 | 5 | 1.0% |
| Other values (219) | 434 |
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 5.6 | 1 | 0.2% |
| 6.3 | 1 | 0.2% |
| 7 | 2 | |
| 7.2 | 3 | |
| 7.4 | 1 | 0.2% |
| 7.5 | 1 | 0.2% |
| 8.1 | 1 | 0.2% |
| 8.3 | 2 | |
| 8.4 | 2 |
| Value | Count | Frequency (%) |
| 50 | 16 | |
| 48.8 | 1 | 0.2% |
| 48.5 | 1 | 0.2% |
| 48.3 | 1 | 0.2% |
| 46.7 | 1 | 0.2% |
| 46 | 1 | 0.2% |
| 45.4 | 1 | 0.2% |
| 44.8 | 1 | 0.2% |
| 44 | 1 | 0.2% |
| 43.8 | 1 | 0.2% |
Interactions
Correlations
| AGE | B | CHAS | CRIM | DIS | INDUS | LSTAT | MEDV | NOX | PTRATIO | RAD | RM | TAX | ZN | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AGE | 1.000 | -0.222 | 0.000 | 0.661 | -0.779 | 0.646 | 0.630 | -0.543 | 0.774 | 0.355 | 0.413 | -0.271 | 0.518 | -0.510 |
| B | -0.222 | 1.000 | 0.000 | -0.342 | 0.249 | -0.277 | -0.204 | 0.179 | -0.295 | -0.065 | -0.275 | 0.060 | -0.326 | 0.142 |
| CHAS | 0.000 | 0.000 | 1.000 | 0.114 | 0.037 | 0.080 | 0.000 | 0.140 | 0.095 | 0.088 | 0.102 | 0.096 | 0.028 | 0.000 |
| CRIM | 0.661 | -0.342 | 0.114 | 1.000 | -0.707 | 0.686 | 0.588 | -0.531 | 0.790 | 0.433 | 0.732 | -0.284 | 0.724 | -0.502 |
| DIS | -0.779 | 0.249 | 0.037 | -0.707 | 1.000 | -0.751 | -0.549 | 0.446 | -0.880 | -0.322 | -0.496 | 0.263 | -0.574 | 0.586 |
| INDUS | 0.646 | -0.277 | 0.080 | 0.686 | -0.751 | 1.000 | 0.606 | -0.568 | 0.776 | 0.429 | 0.470 | -0.397 | 0.662 | -0.600 |
| LSTAT | 0.630 | -0.204 | 0.000 | 0.588 | -0.549 | 0.606 | 1.000 | -0.831 | 0.620 | 0.466 | 0.376 | -0.621 | 0.520 | -0.470 |
| MEDV | -0.543 | 0.179 | 0.140 | -0.531 | 0.446 | -0.568 | -0.831 | 1.000 | -0.563 | -0.556 | -0.347 | 0.634 | -0.562 | 0.430 |
| NOX | 0.774 | -0.295 | 0.095 | 0.790 | -0.880 | 0.776 | 0.620 | -0.563 | 1.000 | 0.391 | 0.586 | -0.310 | 0.650 | -0.605 |
| PTRATIO | 0.355 | -0.065 | 0.088 | 0.433 | -0.322 | 0.429 | 0.466 | -0.556 | 0.391 | 1.000 | 0.318 | -0.313 | 0.453 | -0.458 |
| RAD | 0.413 | -0.275 | 0.102 | 0.732 | -0.496 | 0.470 | 0.376 | -0.347 | 0.586 | 0.318 | 1.000 | -0.107 | 0.705 | -0.265 |
| RM | -0.271 | 0.060 | 0.096 | -0.284 | 0.263 | -0.397 | -0.621 | 0.634 | -0.310 | -0.313 | -0.107 | 1.000 | -0.272 | 0.353 |
| TAX | 0.518 | -0.326 | 0.028 | 0.724 | -0.574 | 0.662 | 0.520 | -0.562 | 0.650 | 0.453 | 0.705 | -0.272 | 1.000 | -0.362 |
| ZN | -0.510 | 0.142 | 0.000 | -0.502 | 0.586 | -0.600 | -0.470 | 0.430 | -0.605 | -0.458 | -0.265 | 0.353 | -0.362 | 1.000 |
Missing values
Sample
| CRIM | ZN | INDUS | CHAS | NOX | RM | AGE | DIS | RAD | TAX | PTRATIO | B | LSTAT | MEDV | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.00632 | 18.0 | 2.31 | 0.000000 | 0.538 | 6.575 | 65.2 | 4.0900 | 1 | 296 | 15.3 | 396.90 | 4.980000 | 24.0 |
| 1 | 0.02731 | 0.0 | 7.07 | 0.000000 | 0.469 | 6.421 | 78.9 | 4.9671 | 2 | 242 | 17.8 | 396.90 | 9.140000 | 21.6 |
| 2 | 0.02729 | 0.0 | 7.07 | 0.000000 | 0.469 | 7.185 | 61.1 | 4.9671 | 2 | 242 | 17.8 | 392.83 | 4.030000 | 34.7 |
| 3 | 0.03237 | 0.0 | 2.18 | 0.000000 | 0.458 | 6.998 | 45.8 | 6.0622 | 3 | 222 | 18.7 | 394.63 | 2.940000 | 33.4 |
| 4 | 0.06905 | 0.0 | 2.18 | 0.000000 | 0.458 | 7.147 | 54.2 | 6.0622 | 3 | 222 | 18.7 | 396.90 | 12.715432 | 36.2 |
| 5 | 0.02985 | 0.0 | 2.18 | 0.000000 | 0.458 | 6.430 | 58.7 | 6.0622 | 3 | 222 | 18.7 | 394.12 | 5.210000 | 28.7 |
| 6 | 0.08829 | 12.5 | 7.87 | 0.069959 | 0.524 | 6.012 | 66.6 | 5.5605 | 5 | 311 | 15.2 | 395.60 | 12.430000 | 22.9 |
| 7 | 0.14455 | 12.5 | 7.87 | 0.000000 | 0.524 | 6.172 | 96.1 | 5.9505 | 5 | 311 | 15.2 | 396.90 | 19.150000 | 27.1 |
| 8 | 0.21124 | 12.5 | 7.87 | 0.000000 | 0.524 | 5.631 | 100.0 | 6.0821 | 5 | 311 | 15.2 | 386.63 | 29.930000 | 16.5 |
| 9 | 0.17004 | 12.5 | 7.87 | 0.069959 | 0.524 | 6.004 | 85.9 | 6.5921 | 5 | 311 | 15.2 | 386.71 | 17.100000 | 18.9 |
| CRIM | ZN | INDUS | CHAS | NOX | RM | AGE | DIS | RAD | TAX | PTRATIO | B | LSTAT | MEDV | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 496 | 0.28960 | 0.0 | 9.69 | 0.0 | 0.585 | 5.390 | 72.900000 | 2.7986 | 6 | 391 | 19.2 | 396.90 | 21.140000 | 19.7 |
| 497 | 0.26838 | 0.0 | 9.69 | 0.0 | 0.585 | 5.794 | 70.600000 | 2.8927 | 6 | 391 | 19.2 | 396.90 | 14.100000 | 18.3 |
| 498 | 0.23912 | 0.0 | 9.69 | 0.0 | 0.585 | 6.019 | 65.300000 | 2.4091 | 6 | 391 | 19.2 | 396.90 | 12.920000 | 21.2 |
| 499 | 0.17783 | 0.0 | 9.69 | 0.0 | 0.585 | 5.569 | 73.500000 | 2.3999 | 6 | 391 | 19.2 | 395.77 | 15.100000 | 17.5 |
| 500 | 0.22438 | 0.0 | 9.69 | 0.0 | 0.585 | 6.027 | 79.700000 | 2.4982 | 6 | 391 | 19.2 | 396.90 | 14.330000 | 16.8 |
| 501 | 0.06263 | 0.0 | 11.93 | 0.0 | 0.573 | 6.593 | 69.100000 | 2.4786 | 1 | 273 | 21.0 | 391.99 | 12.715432 | 22.4 |
| 502 | 0.04527 | 0.0 | 11.93 | 0.0 | 0.573 | 6.120 | 76.700000 | 2.2875 | 1 | 273 | 21.0 | 396.90 | 9.080000 | 20.6 |
| 503 | 0.06076 | 0.0 | 11.93 | 0.0 | 0.573 | 6.976 | 91.000000 | 2.1675 | 1 | 273 | 21.0 | 396.90 | 5.640000 | 23.9 |
| 504 | 0.10959 | 0.0 | 11.93 | 0.0 | 0.573 | 6.794 | 89.300000 | 2.3889 | 1 | 273 | 21.0 | 393.45 | 6.480000 | 22.0 |
| 505 | 0.04741 | 0.0 | 11.93 | 0.0 | 0.573 | 6.030 | 68.518519 | 2.5050 | 1 | 273 | 21.0 | 396.90 | 7.880000 | 11.9 |